Stochastic Optimization from Distributed, Streaming Data in Rate-limited Networks

نویسندگان

  • Matthew S. Nokleby
  • Waheed Uz Zaman Bajwa
چکیده

Motivated by machine learning applications in networks of sensors, internet-of-things (IoT) devices, and autonomous agents, we propose techniques for distributed stochastic convex learning from high-rate data streams. The setup involves a network of nodes—each one of which has a stream of data arriving at a constant rate—that solve a stochastic convex optimization problem by collaborating with each other over ratelimited communication links. To this end, we present and analyze two algorithms—termed distributed stochastic approximation mirror descent (D-SAMD) and accelerated distributed stochastic approximation mirror descent (AD-SAMD)—that are based on two stochastic variants of mirror descent. The main collaborative step in the proposed algorithms is approximate averaging of the local, noisy subgradients using distributed consensus. While distributed consensus is well suited for collaborative learning, its use in our setup results in perturbed subgradient averages due to rate-limited links, which may slow down or prevent convergence. Our main contributions in this regard are: (i) bounds on the convergence rates of D-SAMD and AD-SAMD in terms of the number of nodes, network topology, and ratio of the data streaming and communication rates, and (ii) sufficient conditions for order-optimum convergence of D-SAMD and ADSAMD. In particular, we show that there exist regimes under which AD-SAMD, when compared to D-SAMD, achieves orderoptimum convergence with slower communications rates. This is in contrast to the centralized setting in which use of accelerated mirror descent results in a modest improvement over regular mirror descent for stochastic composite optimization. Finally, we demonstrate the effectiveness of the proposed algorithms using numerical experiments.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Rate Allocation in Overlay Content Distribution

This paper addresses the optimal rate allocation problem in overlay content distribution for efficient utilization of limited bandwidths. We systematically present a series of optimal rate allocation strategies by dividing our discussions into four typical scenarios. Based on application-specific requirements, these scenarios reflect the contrast between elastic and streaming content distributi...

متن کامل

Diffusion Adaptation over Networks

Adaptive networks are well-suited to perform decentralized information processing and optimization tasks and to model various types of self-organized and complex behavior encountered in nature. Adaptive networks consist of a collection of agents with processing and learning abilities. The agents are linked together through a connection topology, and they cooperate with each other through local ...

متن کامل

Optimal Allocation of Distributed Generation in Microgrid by Considering Load Modeling

Recent increment in carbon emission due to the dependency on fossil fuels in power generation sector is a critical issue in the last decade. The motivation to Distributed Generation (DG) in order to catch low carbon networks is rising. This research seeks to experience DG existence in local energy servicing in microgrid structure. Optimal sizing and placement of DG units is followed by this pap...

متن کامل

Optimal Relief Order Quantity under Stochastic Demand and Lead-time

In this paper, a newsboy model is developed under uniformly distributed lead-time and demand that is an appropriate assumption in obtaining optimal relief inventory of humanitarian disasters. It is noteworthy that limited historical data are in hand on relief operations. Hence, analytical and approximate solutions for optimal relief order quan­tity were derived. The effect of lead-time uncertai...

متن کامل

Design and Test of the Real-time Text mining dashboard for Twitter

One of today's major research trends in the field of information systems is the discovery of implicit knowledge hidden in dataset that is currently being produced at high speed, large volumes and with a wide variety of formats. Data with such features is called big data. Extracting, processing, and visualizing the huge amount of data, today has become one of the concerns of data science scholar...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1704.07888  شماره 

صفحات  -

تاریخ انتشار 2017